Semi-supervised disentangled framework for transferable named entity recognition

نویسندگان

چکیده

Named entity recognition (NER) for identifying proper nouns in unstructured text is one of the most important and fundamental tasks natural language processing. However, despite widespread use NER models, they still require a large-scale labeled data set, which incurs heavy burden due to manual annotation. Domain adaptation promising solutions this problem, where rich from relevant source domain are utilized strengthen generalizability model based on target domain. mainstream cross-domain models affected by following two challenges (1) Extracting domain-invariant information such as syntactic transfer. (2) Integrating domain-specific semantic into improve performance NER. In study, we present semi-supervised framework transferable NER, disentangles latent variables variables. proposed framework, integrated with using predictor. The disentangled three mutual regularization terms, i.e., maximizing between original embedding, minimizing Extensive experiments demonstrated that our can obtain state-of-the-art cross-lingual benchmark sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semi-supervised Bootstrapping approach for Named Entity Recognition

The aim of Named Entity Recognition (NER) is to identify references of named entities in unstructured documents, and to classify them into pre-defined semantic categories. NER often aids from added background knowledge in the form of gazetteers. However using such a collection does not deal with name variants and cannot resolve ambiguities associated in identifying the entities in context and a...

متن کامل

A Simple Semi-supervised Algorithm For Named Entity Recognition

We present a simple semi-supervised learning algorithm for named entity recognition (NER) using conditional random fields (CRFs). The algorithm is based on exploiting evidence that is independent from the features used for a classifier, which provides high-precision labels to unlabeled data. Such independent evidence is used to automatically extract highaccuracy and non-redundant data, leading ...

متن کامل

Robust Multilingual Named Entity Recognition with Shallow Semi-Supervised Features

We present a multilingual Named Entity Recognition approach based on a robust and general set of features across languages and datasets. Our system combines shallow local information with clustering semi-supervised features induced on large amounts of unlabeled text. Understanding via empirical experimentation how to effectively combine various types of clustering features allows us to seamless...

متن کامل

A Semi-supervised Learning Approach to Arabic Named Entity Recognition

We present ASemiNER, a semisupervised algorithm for identifying Named Entities (NEs) in Arabic text. ASemiNER does not require annotated training data, or gazetteers. It also can be easily adapted to handle more than the three standard NE types (Person, Location, and Organisation). To our knowledge, our algorithm is the first study that intensively investigates the semi-supervised pattern-based...

متن کامل

Semi-supervised Named Entity Recognition in noisy-text

Many of the existing Named Entity Recognition (NER) solutions are built based on news corpus data with proper syntax. These solutions might not lead to highly accurate results when being applied to noisy, user generated data, e.g., tweets, which can feature sloppy spelling, concept drift, and limited contextualization of terms and concepts due to length constraints. The models described in this...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neural Networks

سال: 2021

ISSN: ['1879-2782', '0893-6080']

DOI: https://doi.org/10.1016/j.neunet.2020.11.017